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Edinburgh, U.K. 

Distribution rights by IntelliGenetics, Inc. 

MPsrch_pp protein - protein database search, using 
Smith-Waterman algorithm 


Run on: 
Seconds 


Wed Nov 1 08:32:29 1995; MasPar time 10.28 

476.357 Million cell 


updates/sec 

Tabular output not generated. 

Title: >US-08-121-713B-54 
Description: (1:771) from US08121713B .pep 

Perfect Score: 5765 
Sequence: 1 

MGWLTRIVCLFWGVLLTARA ENKKGRNRRTHE FERAPRS V 771 


Scoring table: 

Searched: 
Database : 


PAM 150 
Gap 11 

53402 seqs, 6354270 residues 


a-geneseq 
1 a-genl 


2 
3 
4 
5 
6 
7 
8 
9 


a-gen2 
a-gen3 
a-gen4 
a-gen5 
a-gen6 
a-gen7 
a-gen8 
a-gen9 


Statistics 


10 a-genlO 

Mean 37.851; Variance 162.907; scale 0.232 


Pred. No. is the number of results predicted by chance to 

have a 

score greater than or equal to the score of the result 
being printed, 

and is derived by analysis of the total score distribution. 


SUMMARIES 


Result Query 

No. Score Match Length DB ID 
Pred. No. 


Description 



1 

108 

1 . 

9 

485 

10 

2 . 

98e+00 







2 

102 

1 . 

8 

240 

3 

7 . 

88e+00 







3 

101 

1 . 

8 

355 

3 

9. 

25e+00 







4 

101 

1 . 

8 

526 

3 

9. 

25e+00 







5 

101 

1 . 

8 

643 

3 

9. 

25e+00 







6 

101 

1 . 

8 

652 

3 

9. 

25e+00 







7 

101 

1 . 

8 

821 

4 

9 . 

25e+00 







8 

101 

1 . 

8 

679 

8 

9. 

25e+00 







9 

100 

1 . 

7 

769 

3 

1 . 

08e+01 







10 

98 

1 # 

7 

726 

2 

1 . 

49e+01 







11 

98 

1 . 

7 

1365 

6 

1 . 

49e+01 







12 

98 

1 . 

7 

726 

3 

1 . 

49e+01 







13 

92 

1 . 

6 

106 

5 

3 . 

77e+01 







14 

92 

1 . 

6 

1257 

9 

3. 

77e+01 







15 

91 

1 . 

6 

468 

1 

4 . 

39e+01 







16 

91 

1 . 

6 

460 

. 4 

4 . 

39e+01 







17 

90 

1 . 

6 

691 

7 

5 . 

lle+01 







18 

90 

1 . 

6 

403 

1 

5 . 

lle+01 







19 

90 

1 . 

6 

403 

8 

5 . 

lle+01 







20 

89 

1 . 

5 

498 

3 

5 . 

94e+01 







21 

89 

1 . 

5 

380 

3 

5 . 

94e+01 







22 

88 

1 . 

5 

131 

5 

6. 

91e+01 







23 

87 

1 . 

5 

118 

10 

8 . 

02e+01 







24 

87 

1 . 

5 

102 

10 

8. 

02e+01 







25 

87 

1. 

5 

922 

10 

8. 

02e+01 







26 

87 

1. 

5 

118 

10 

8. 

02e+01 







27 

87 

1. 

5 

816 

3 


R55585 

AmEPV NPH-1 

J. -U.Lt.J_l X V in J. 11 _ . • 

R15263 

Extracellular domain 

R15264 

Extracellular domain 

R15266 

Clone pTBl228-encoded 

Rl 5267 

X\ _L — ) ^ \J 1 

Clone rjTB122 9-encoded 

R15269 

Clone toTB12 83-encoded 

R21080 

f Ict r erector nrot ein 

R41 51 7 

i\ o CllLL • 

R15268 

X \ _l_ ij \J \J 

Clone DTB12 8 4 -encoded 

R10933 

KGF receptor . 

R167R0 

KRF S 

R14280 

Murine KGF receotor. 

x x x> ^l. x x v x v x> x_ \^ y*/ v** \*/ x> * 

r\c \j ~s *j 

Hlirnqri T 1 1 \/mTltl Pi r* \/t P TP 

R46627 

Neurocan core orotein 

P90525 

B cell stimulating fa 

R22616 

IL- 6R for soluble IL- 

■X. X_J \J X V -1— V-/ JL_ v_/ J— l_XX_i* -JL. V^r -i- JLJ 

R38735 

X\ ~-> U i J J 

Rpta — c/al rirto^i qp 

XJ L- CI UQlUU LUOl . ClU O V_ » 

X Jl 1 U J 

kd rprrrp^^i nn-^i ^^n 

R40855 

3 8 kd r ecxr e s s ion as ^or 

Rl 3270 

Ti\7m"n"h n c\/t p Activpitirin 

R13272 

Pol v"h pdri Ti-c;nl nhl p TiA 

XT UX y lit ^wiX. _L XX O \J -L UJJ -L \3 XJxi. 

R25579 

Tnonl in nrprnr^nr MT? 

X llO LX _L _L X X J- \ UL X O W X 111 J 

L\^J 1 ~J \J 

1^11 LJT\±\ X Jv^ 1 COIUUCO U\J 

R57333 

NMDAR1-4 residues 803 

R55531 

Human NMDA Rid recept 

R57329 

NMDAR1-3A residues 80 

R14444 

A.faecalis penicillin 


8.02e+01 



28 

87 

1 

.5 

911 

2 

R10333 

Deduced sequence of t 

8. 

02e+01 









29 

87 

1 

.5 

938 

10 

R55532 

Human NMDA Rla recept 

8 . 

02e+01 









30 

87 

1 

. 5 

920 

9 

R49043 

NMDA receptor channel 

8. 

02e+01 








31 

87 

1 

. 5 

920 

9 

R49044 

NMDA receptor channel 

8. 

02e+01 








32 

86 

1 

.5 

386 

9 

R51061 

Sequence encoded by t 

9. 

30e+01 








33 

86 

1 

. 5 

283 

8 

R45455 

Theileria sergenti ma 

9. 

30e+01 









34 

86 

1 

. 5 

886 

1 

P80345 

Sequence encoded by S 

9. 

30e+01 








35 

85 

1 

.5 

344 

1 

P90528 

B cell stimulating fa 

1. 

08e+02 









36 

85 

1 

.5 

460 

2 

P81061 

Sequence of rhinoviru 

1. 

08e+02 








37 

85 

1 

.5 

2164 

1 

P80131 

Peptides translated f 

1. 

08e+02 








38 

85 

1 

.5 

3685 

1 

P90290 

Human Duchenne muscul 

1. 

08e+02 









39 

85 

1 

.5 

468 

3 

P90284 

Sequence of a recepto 

1. 

08e+02 









40 

85 

1 

. 5 

323 

1 

P90527 

B cell stimulating fa 

1. 

08e+02 








41 

85 

1 

. 5 

3685 

3 

P90373 

Sequence encoded by h 

1. 

08e+02 









A O 

42 

85 

1 

. 5 

2164 

2 

P81045 

Sequence of the viral 

1. 

08e+02 









43 

85 

1 

.5 

110 

5 

R26951 

Human T lymphocyte re 

1. 

08e+02 









44 

85 

1 

.5 

247 

8 

R42382 

H. somnus lppA. 

1. 

08e+02 









45 

85 

1 

.5 

265 

3 

R12844 

HTLV-1 protein expres 


1.08e+02 


ALIGNMENTS 

RESULT 1 

ID R55585 standard; Protein; 485 AA. 

AC R55585; 

DT 18-JAN-1995 (first entry) 

DE AmEPV NPH-1. 

KW. Spheroidin; gene expression; vector; insect cell culture; 

KW mammal cell culture; AmEPV; NPH-1; 

KW nucleoside-triphosphate-phosphohydrolase . 

OS Amsacta moorei entomopoxvirus . 

PN W09413812-A. 

PD 23-JUN-1994. 

PF 07-DEC-1993; U11907. 

PR 07-DEC-1992; US-991867. 


PA (UYFL ) UNIV FLORIDA. 

PI Gruidl ME, Hall RL, Moyer RW; 

DR WPI; 94-217887/26. 

PT New entomopoxvirus polynucleotide sequences, proteins and 
vectors 

PT - are used for expression of heterologous proteins in both 
insect 

PT and mammalian host cells 

PS Disclosure; Page 85-86; 118pp; English. 

CC The AmEPV spheroidin gene was isolated and sequenced. 

Mapping 

CC showed the gene to be located at the 3 f end of a nucleoside- 
CC triphosphate-phosphohydrolase gene (NPH-1) . The sequence of 
the 

CC NPH-1 gene is given in Q66824, and its encoded amino acid 
sequence 

CC in R55585. The spheroidin gene can be used as the location 
for 

CC insertion of heterologous DNA in insect and mammalian 

expression 

CC systems. 

SQ Sequence 4 85 AA; 

DB 10; Score 108; Match 23.9%; QryMatch 1.9%; Pred. No. 

2.98e+00; 

Matches 16; Conservative 26; Mismatches 21; Indels 
4; Gaps 4; 

Db 241 

sykrlieadsltetnyidgyakknif yhniimsdeqsklynmaekydyktelgglktmrr 300 


S YKEMLESNNVI - T- F-NGLANS S S - YHT FLLDEERSRLYVGAKDHI FS FDLVNI KDFQK 90 
Db 301 lissfaf 307 
Qy 91 IVWPVSY 97 

Search completed: Wed Nov 1 08:32:56 1995 
Job time : 27 sees. 


Release 2. ID John F. Collins, Biocomputing Research 

Unit. 

Copyright (c) 1993, 1994, 1995 University of 
Edinburgh, U.K. 

Distribution rights by IntelliGenetics, Inc. 

MPsrchjpp protein - protein database search, using 
Smith-Waterman algorithm 

Run on: Wed Nov 1 08:31:01 1995; MasPar time 27.72 

Seconds 

663.261 Million cell 

updates/sec 

Tabular output not generated. 

Title: MJS-08-121-713B-54 
Description: (1:771) from US08121713B .pep 

Perfect Score: 5765 
Sequence: 1 

MGWLTRIVCLFWGVLLTARA ENKKGRNRRTHE FERAPRS V 771 

Scoring table: PAM 150 

Gap 11 

Searched: 78488 seqs, 23849247 residues 

Database: pir45 


1 

ANNOl 

2 

ANN02 

3 

ANNO 3 

4 

UNANNOl 

5 

UNANN02 

6 

UNANN03 

7 

UNANN04 

8 

UNANN05 

9 

UNANN06 

10 

UNANN07 

11 

UNREV1 

12 

UNREV2 


Statistics: Mean 50.575; Variance 117.403; scale 0.431 

Pred. No. is the number of results predicted by chance to 

have a 

score greater than or equal to the score of the result 
being printed, 

and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 
Pred. No. 



1 

5765 

100. 

0 

771 

10 

0. 

00e+00 







2 

5278 

91. 

6 

772 

9 

0. 

00e+00 







3 

917 

15. 

9 

730 

9 

8. 

14e-133 







4 

804 

13. 

9 

656 

9 

7. 

38e-113 







5 

799 

13. 

9 

724 

9 

5. 

59e-112 







6 

778 ' 

13. 

5 

711 

12 

2. 

73e-108 







7 

158 

2. 

7 

441 

6 

1. 

46e-06 







8 

158 

2. 

7 

403 

6 

1. 

46e-06 







9 

147 

2. 

5 

295 

6 

3. 

85e-05 







10 

116 

2. 

0 

122 

6 

2. 

01e-01 







11 

116 

2. 

0 

122 

6 

2 . 

01e-01 







12 

114 

2. 

0 

2733 

4 

3. 

35e-01 







13 

114 

2. 

0 

1174 

4 

3. 

35e-01 







14 

110 

1. 

9 

2731 

1 

9. 

15e-01 







15 

110 

1. 

9 

501 

1 

9. 

15e-01 







16 

108 

1. 

9 

1260 

6 

1. 

50e+00 







17 

108 

1. 

9 

485 

4 

1. 

50e+00 







18 

108 

1 . 

9 

648 

1 

1. 

50e+00 







19 

107 

1 . 

9 

1255 

6 

1. 

91e+00 







20 

107 

1. 

9 

1259 

6 

1. 

91e+00 







21 

104 

1 . 

8 

469 

9 

3. 

94e+00 







22 

103 

1. 

8 

254 

9 

4. 

99e+00 







23 

103 

1. 

8 

713 

9 

4. 

99e+00 







24 

102 

1. 

8 

823 

4 

6. 

32e+00 







25 

102 

1. 

8 

824 

12 

6. 

32e+00 







26 

102 

1. 

8 

873 

9 


D49423 

semaphorin III precu 

A49069 

collapsin - chicken 

JH0798 

fasciclin IV precurs 

B49423 

semaphorin I - fruit 

C49423 

semaphorin II precur 

A49423 

semaphorin I precurs 

S29921 

hypothetical protein 

E42521 

A39R protein — vacci 

JQ1775 

SalL9R protein - vac 

H36852 

A43R protein - vario 

JQ1845 

14R protein - variol 

S15760 

RNA-directed RNA pol 

A39927 

RNA-ciirected RNA pol 

VFIHJH 

RNA-directed RNA pol 

PWPMA 

H+-transporting ATP 

S05479 

neural cell adhesion 

A44279 

spheroidin - Amsacta 

NPVZCP 

nucleoside- triphosph 

S17655 

neural cell adhesion 

S36126 

neural cell adhesion 

A25376 

SPS2 protein - yeast 

C42691 

fibroblast growth fa 

S42803 

fibroblast growth fa 

B35963 

protein- tyrosine kin 

S24108 

protein- tyro sine kin 

B41054 

fasciclin II Pi-link 


6. 

32e+00 









27 

102 

1 . 

8 

813 

4 

S25060 

fibroblast growth fa 

6. 

32e+00 









28 

102 

1 . 

8 

806 

4 

A35963 

protein- tyrosine kin 

6. 

32e+00 









29 

102 

1 . 

8 

811 

9 

A41054 

fasciclin II, transm 

6. 

32e+00 









30 

102 

1. 

8 

174 

6 

JQ1611 

nonstructural protei 

6. 

32e+00 









31 

102 

1. 

8 

538 

6 

S22167 

neural cell adhesion 

6. 

32e+00 









32 

102 

1 . 

8 

174 

3 

JQ1564 

nonstructural protei 

6. 

32e+00 









33 

102 

1. 

8 

813 

4 

A49123 

fibroblast growth fa 

6. 

32e+00 









34 

101 

1. 

8 

705 

9 

B42691 

fibroblast growth fa 

7. 

99e+00 









35 

101 

1. 

8 

682 

4 

A35969 

heparin-binding grow 

7. 

99e+00 









36 

101 

1. 

8 

822 

4 

A45081 

fibroblast growth fa 

7. 

99e+00 









37 

101 

1 . 

8 

822 

4 

A41794 

keratinocyte growth 

7. 

99e+00 









38 

101 

1 . 

8 

822 

1 

TVHUF2 

fibroblast growth fa 

7. 

99e+00 









39 

101 

1 . 

8 

769 

4 

S16236 

fibroblast growth fa 

7. 

99e+00 









40 

101 

1 . 

8 

821 

4 

A42691 

fibroblast growth fa 

7. 

99e+00 









41 

100 

1 . 

7 

490 

4 

A32140 

steroid 15beta-monoo 

1 . 

01e+01 









42 

100 

1 . 

7 

1898 

10 

A45973 

trichohyalin - human 

1. 

01e+01 









43 

99 

1. 

7 

811 

7 

S39901 

nwsA protein - Brady 

1. 

27e+01 









44 

99 

1. 

7 

811 

7 

S32683 

nwsA protein - Brady 

1. 

27e+01 









45 

99 

1. 

7 

197 

9 

A29648 

f emale- specif ic tran 

1. 

27e+01 









ALIGNMENTS 


RESULT 1 

ENTRY 

TITLE 

ORGANISM 

DATE 

#text_change 

ACCESSIONS 

REFERENCE 
#authors 
# journal 


D49423 #type complete 

semaphorin III precursor - human 
#formal_name Homo sapiens #common_name man 
06-Jan-1995 #sequence_revision 06-Jan-1995 

27-Jan-1995 
D49423 
A49423 

Kolodkin, A.L.; Matthes, D.J.; Goodman, C.S 
Cell (1993) 75:1389-1399 


#title The Semaphorin genes encode a family of 

transmembrane and 

secreted growth cone guidance molecules. 
#accession D49423 

##status preliminary 
##molecule_type mRNA 
##residues 1-771 ##label KOL 

##cross-references GB:L26081 

##note nucleotide sequence not given 

SUMMARY #length 771 #molecular-weight 88889 #checksum 

6249 

DB 10; Score 5765; Match 100.0%; QryMatch 100.0%; Pred. 
No. 0.00e+00; 

Matches 771; Conservative 0; Mismatches 0; Indels 
0; Gaps 0; 

RESULT 2 

ENTRY A49069 #type complete 

TITLE collapsin - chicken 

ORGANISM #formal_name Gallus gallus #common_name chicken 

DATE 07-Apr-1994 #sequence_revision 07-Apr-1994 

#text_change 

07-Apr-1994 
ACCESSIONS A49069 
REFERENCE A4 90 69 

ttauthors Luo, Y.; Raible, D.; Raper, J. A. 

#journal Cell (1993) 75:217-227 

tttitle Collapsin: a protein in brain that induces the 

collapse and 

paralysis of neuronal growth cones. 
#accession A49069 

##status preliminary; not compared with conceptual 

translation 

##molecule_type mRNA 
##residues 1-772 ##label LUO 

iicross-references GB:U02528 
SUMMARY #length 772 #molecular-weight 88867 #checksum 

9712 

DB 9; Score 5278; Match 88.5%; QryMatch 91.6%; Pred. 
No. 0.00e+00; 

Matches 683; Conservative 64; Mismatches 24; Indels 
1; Gaps 1; 

Db 1 

mgwlrgiallslgvllagrvncqhvknnvprlklsykemlesnnivnf nglansssyhtf 60 

I I I I I I I I I I :: I I I : I I I I II I I I I I I I I I I I I I : : 
I I I I I I I I I I I I I 
Qy 1 

MGWLTRIVCLFWGVLLTARANYQNGKNNVPRLKLSYKEMLESNNVITFNGLANSSSYHTF 60 
Db 61 

lldeersrlyvgakdhif sfnlvnikeyqkivwpvshsrrdeckwagkdilrecanf ikv 120 


I I I I I I I I I I I I I I I I I I I I : I I I II : : I I I I I I I I 
: I I I I I I I I I I I I I : I I I I I I I I 
Qy 61 

LLDEERSRLYVGAKDHIFSFDLVNIKDFQKIVWPVSYTRRDECKWAGKDILKECANFIKV 120 
Db 121 

lktynqthlyacgtgafhpmctyievgshpednif rmedshf engrgkspydpklltasl 180 

I I : I I I I I I I I I I I I I I I I : II I I I : I 
I I I I I I I : : I : I I I I I I I I I I I I I I I I I I I I I 
Qy 121 

LKAYNQTHLYACGTGAFHPICTYIEIGHHPEDNIFKLENSHFENGRGKSPYDPKLLTASL 180 
Db 181 

lvdgelysgtaadfmgrdf aif rtlghhhpirteqhdsrwlndprf isahlipesdnped 240 

I : I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I : I I I I I I I 

Qy 181 

LIDGELYSGTAADFMGRDFAIFRTLGHHHPIRTEQHDSRWLNDPKFISAHLISESDNPED 240 
Db 241 

dkiyf f f renaidgehtgkatharigqickndf gghrslvnkwttf lkarlicsvpgpng 300 

I I : I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I L. I I I I I 

Qy 241 

DKVYFFFRENAIDGEHSGKATHARIGQICKNDFGGHRSLVNKWTTFLKARLICSVPGPNG 300 
Db 301 

idthf delqdvf lmnskdpknpivygvf ttssnif kgsavcmysmtdvrrvf lgpyahrd 360 

I I I I I I I I I I I I I I I 
I I I I I I : I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I 

Qy 301 

IDTHFDELQDVFLMNFKDPKNPWYGVFTTSSNIFKGSAVCMYSMSDVRRVFLGPYAHRD 3 60 
Db 361 

gpnyqwvpyqgrvpyprpgtcpsktf ggfdstkdlpdevitf arshpamynpvfpinsrp 420 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I : I I I I I I I I I II I I I I I I : I : I I 
Qy 361 

GPNYQWVPYQGRVPYPRPGTCPSKTFGGFDSTKDLPDDVITFARSHPAMYNPVFPMNNRP 420 
Db 421 

imiktdvdyqf tqivvdrvdaedgqydvmf igtdigtvlkvvsipketwheleevlleem 4 80 
I : I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I 

: I I II I I I I I 
Qy 421 

IVIKTDVNYQFTQIWDRVDAEDGQYDVMFIGTDVGTVLKWSIPKETWYDLEEVLLEEM 480 
Db 481 

tvf reptvisamkistkqqqlyigsatgvsqlplhrcdvygkacaecclardpycawdgs 540 
I I I I I I I I I I I 

: I I I I I I I I I I I :: I I : I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I 
Qy 481 

TVFREPTAISAMELSTKQQQLYIGSTAGVAQLPLHRCDIYGKACAECCLARDPYCAWDGS 540 


Db 541 

scsryfptakrrtrrqdirngdplthcsdlqhhdnpsgqtleekiiygvensstf lecsp 600 

: I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : : 
I I : I I I I I I I I I I I I I I I I 
Qy 541 

ACSRYFPTAKRRTRRQDIRNGDPLTHCSDL-HHDNHHGHSPEERIIYGVENSSTFLECSP 599 
Db 601 

ksqraivywqf qkqnddhkveikvddrmirteqglllrslqrrdsgiyf chavehgf iqt 660 

I I I I I : I I I I I I : : I : : : I I I : I I I : : II I : I I I I I I I I I : : I I I 
I : I I I I I I I I I I I 
Qy 600 

KSQRALVYWQFQRRNEERKEEIRVDDHIIRTDQGLLLRSLQQKDSGNYLCHAVEHGFIQT 659 
Db 661 

llkvtlevidtdhleellhkeedadasktkdatnsmtpsqkiwyrdfmqlinhpnlntmd 720 

I I I I I I I I I I I : I I I I I I II : : I : I : I I I I : 
: I II I I I I I : I I I I I I I I I I I I I I I I II 
Qy 660 

LLKVTLEVIDTEHLEELLHKDDDGDGSKTKEMSNSMTPSQKVWYRDFMQLINHPNLNTMD 719 

Db 721 ef ceqvwkrdrkqrrqrpanaqvntnkwkhlqenkkgrnrrthef eraprsv 
772 

I I I I I I I I I I I I I I I I I I : : : I : I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Qy 720 EFCEQVWKRDRKQRRQRPGHTPGNSNKWKHLQENKKGRNRRTHEFERAPRSV 

771 


RESULT 3 
ENTRY 
TITLE 

grasshopper 
ORGANISM 
American 

DATE 

#text_change 

ACCESSIONS 
REFERENCE 

#authors 
Patel, N.H.; 

#journal 
#title 
during 

growth cone guidance in the grasshopper 

embryo. 

#accession JH0798 

##molecule_type mRNA 

##residues 1-730 ##label KOL 

##cross-ref erences GB:L00709 

##experimental_source embryo 
COMMENT This protein plays a role in growth cone guidance in 


JH0798 #type complete 

fasciclin IV precursor - American bird 

#formal_name Schistocerca americana #common_name 

bird grasshopper 
30-Sep-1993 #sequence_revision 30-Sep-1993 

ll-Apr-1995 
JH0798 
JH0798 

Kolodkin, A.; Matthes, D.J.; O'Connor, T.P.; 

Admon, A.; Bent ley, D.; Goodman, C.S. 
Neuron (1992) 9:831-845 

Fasciclin IV: Sequence, expression, and function 


the developing 

central nervous system. 
KEYWORDS glycoprotein; transmembrane protein 

FEATURE 

1-22 #domain signal sequence #status predicted 

#label SIG\ ' 

23-730 #product fasciclin IV #status predicted 

#label MAT\ 

23-627 #domain extracellular #label EXT\ 

628-652 #domain transmembrane #label TMM\ 

653-730 #domain intracellular #label INT\ 

44,71, 163,267, 360, 

539 #binding_site carbohydrate (Asn) (covalent) 

#status 

predicted 

SUMMARY #length 730 #molecular-weight 81214 #checksum 

5881 

DB 9; Score 917; Match 34.8%; QryMatch 15.9%; Pred. 

No. 8.14e-133; 

Matches 174; Conservative 121; Mismatches 167; Indels 
38; Gaps 27; 

Db 42 

lgneshkdhf kllekdhnsllvgarnivynislrdltef teqriewhssgahrelcylkg 101 
I : I I | | | : : : : | | | | : : : : : : | : : : | I : I I I : : I : 

I I 

Qy 51 

LANSSSYHTF-LLDEERSRLYVGAKDHIFSFDLVNIKDF--QKIVWPVSYTRRDECKWAG 107 
Db 102 

kse-ddcqnyirvlakidddrvlicgtnaykplcrhyal — k-dgd-yvvek-eye-grg 154 
I : I I : I : I I : : : : I I I I : I : I : : : : : I : 

: I III 

Qy 108 

KDILKECANFIKVLKAYNQTHLYACGTGAFHPICTYIEIGHHPEDNIFKLENSHFENGRG 167 
Db 155 

lcpf dpdhnstaiysegqlysatvadf sgtdpliyr — g plrtersdlkqlnapnfv 209 

1:11 : : : : : I : I I I : I I I I I I I : I I I : I I I : I : I I 

I : I : 

Qy 168 

KSPYDPKLLTASLLIDGELYSGTAADFMGRDFAIFRTLGHHHPIRTEQHDSRWLNDPKFI 227 
Db 210 

nt-m — ey-n df if f f f retaveyincgkaiysrvarvckhdkggphqf gdrwtsf 1 2 62 

: : : I I I : : I I I I I I : : : I I I : I : : : : I I : I I I : : 

: : I I : I I 
Qy 228 

SAHLISESDNPEDDKVYFFFRENAIDGEHSGKATHARIGQICKNDFGGHRSLVNKWTTFL 2 87 
Db 2 63 

ksrlncsvpgdypf yf neiqstsdiiegnyggqvekliygvf ttpvnsiggsavcaf smk 322 
1:1111111 : : : I : : I : : : : I I I I II : I : I I I I I 


Qy 288 

KARL I CSVPGPNGI D-THFDELQDVFLMNFKDPKNPWYGVFTTS SNI FKGSAVCMYSMS 346 
Db 323 

silesfdgpf keqetmnsnwlavpslkvpeprpgqc v — n-ds-rtlpdvsvnfvks 375 


Qy 347 

DVRRVFLGPYAHRDGPNYQWVPYQG-RVPYPRPGTCPSKTFGGFDSTKDLPDDVITFARS 405 
Db 37 6 

htlmdeavpaf f trpilirislqyrf tkiavdqqvrtpdgkaydvlf igtddgkvikaln 435 
I I : : I : I I I : I : : I : I I I I I : I : I I I I I : I I I I I I 

I : I : : 
Qy 406 

HPAMYNPVFPMNNRPIVIKTDVNYQFTQIWDR-VDAEDGQ-YDVMFIGTDVGTVLKWS 4 63 
Db 436 

sasf dssdtvdsvvieelqvlppgvpvknlyvvrmdgddsklvvvsddeilaiklhrcgs 495 


Qy 464 

IPK-ETWYDLEEVLLEEMTVFR-E-PT-AISAMELSTKQQQLYIGSTAGVAQLPLHRCDI 519 

Db 496 dkitncrecvslqdpycawd 515 

I II : I I I I I I I 
Qy 520 YG-KACAECCLARDPYCAWD 538 

Search completed: Wed Nov 1 08:32:08 1995 
Job time : 67 sees. 


Release 2 . ID John F. Collins, Biocomputing Research 

Unit. 

Copyright (c) 1993, 1994, 1995 University of 
Edinburgh, U.K. 

Distribution rights by IntelliGenetics, Inc. 

MPsrch_pp protein - protein database search, using 
Smith-Waterman algorithm 


Run on : 
Seconds 


Wed Nov 1 08:30:06 1995; MasPar time 16.72 

707.202 Million cell 


updates/sec 

Tabular output not generated. 

Title: >US-08-121-713B-54 
Description: (1:771) from US08121713B. pep 

Perfect Score: 5765 
Sequence: 1 

MGWLTRIVCLFWGVLLTARA ENKKGRNRRTHEFERAPRSV 771 


Scoring table: 

Searched: 
Database : 


PAM 150 
Gap 11 

43470 seqs, 15335248 residues 

swiss-prot31 
1 SPT1 


2 
3 
4 
5 
6 
7 
8 


SPT2 
SPT3 
SPT4 
SPT5 
SPT6 
SPT7 
SPT8 


Statistics 


Mean 52.613; Variance 88.879; scale 0.592 


Pred. No. is the number of results predicted by chance to 

have a 

score greater than or equal to the score of the result 
being printed, 

and is derived by analysis of the total score distribution. 

SUMMARIES 


Result Query 

No. Score Match Length DB ID 
Pred. No. 


Description 



1 

158 

2 

. 7 

9. 

12e-10 





2 

158 

2 

. 7 

9. 

12e-10 





3 

114 

2 

. 0 

1 . 

21e-02 





4 

110 

1 

. 9 

4. 

50e-02 





5 

110 

1 

. 9 

4 . 

50e-02 





6 

108 

1 

. 9 

8. 

59e-02 





7 

108 

1 

. 9 

8. 

59e-02 





8 

107 

1 

. 9 

1 . 

18e-01 





9 

104 

1 

. 8 

3. 

03e-01 





10 

102 

1 

. 8 

5. 

61e-01 





11 

102 

1 

. 8 

5. 

61e-01 





12 

102 

1 

. 8 

5. 

61e-01 





13 

102 

1 

.8 

5. 

61e-01 





14 

102 

1 

.8 

5. 

61e-01 





15 

102 

1 

. 8 

5. 

61e-01 





16 

101 

1 

. 8 

7. 

61e-01 





17 

101 

1 

.8 

7. 

61e-01 





18 

100 

1 

.7 

1. 

03e+00 





19 

100 

1 

.7 

1. 

03e+00 





20 

99 

1 

. 7 

1 . 

39e+00 





21 

98 

1 

. 7 

1 . 

86e+00 





22 

98 

1 

. 7 

1 . 

86e+00 





23 

98 

1 

. 7 

1. 

86e+00 





24 

97 

1 

. 7 

2. 

49e+00 





25 

96 

1 

. 7 

3. 

33e+00 





26 

96 

1 

.7 

3. 

33e+00 





27 

96 

1 

.7 

3. 

33e+00 





403 

7 

VA39_ 

_VACCC 

441 

7 

VA39_ 

_VACCV 

2733 

6 

RRPB_ 

_CVMA5 

501 

1 

ATPA_ 

_PEA 

2731 

6 

RRPB_ 

_CVMJH 

1260 

1 

CAML_ 

_MOUSE 

648 

5 

NTP1_ 

_CBEPV 

1259 

1 

CAML_ 

RAT 

469 

7 

SPS2_ 

_YEAST 

823 

2 

CEK3_ 

_CHICK 

174 

8 

VNSC_ 

_PHODV 

811 

3 

FS22_ 

_DROME 

873 

3 

FS21_ 

_DROME 

806 

2 

CEK2_ 

_CHICK 

813 

3 

FGR2_ 

XENLA 

654 

1 

BFR2_ 

_HUMAN 

821 

3 

FGR2_ 

HUMAN 

490 

2 

CPCC_ 

_RAT 

1898 

7 

TRHY_ 

_HUMAN 

197 

7 

TRSF_ 

DROME 

1088 

6 

RRPO_ 

ROTPG 

707 

4 

KGFR_ 

_MOUSE 

1365 

4 

KRE5_ 

_YEAST 

701 

7 

TP20 

_YEAST 

859 

3 

ENV_EIAV3 

160 

5 

NCAP_ 

BEV 

239 

6 

RPSE_ 

_BACSU 


PROTEIN A39. 
PROTEIN A3 9. 
RNA-DIRECTED RNA POLY 
ATP SYNTHASE ALPHA CH 
RNA-DIRECTED RNA POLY 
NEURAL CELL ADHESION 
NUCLEOSIDE TRIPHOSPHA 
NEURAL CELL ADHESION 
SPORULATION-SPECIFIC 
TYROSINE KINASE RE CEP 
NONSTRUCTURAL PROTEIN 
FASCICLIN II, PHOSPHA 
FASCICLIN II, MEMBRAN 
TYROSINE KINASE RE CEP 
FIBROBLAST GROWTH FAC 
FIBROBLAST GROWTH FAC 
FIBROBLAST GROWTH FAC 
CYTOCHROME P4 50 IIC12 
TRICHOHYALIN. 
FEMALE- SPECIFIC TRANS 
RNA-DIRECTED RNA POLY 
KERAT INOCYTE GROWTH F 
KILLER TOXIN-RES I STAN 
PROTEIN TRANSPORT PRO 
ENV POLYPROTEIN PRECU 
NUCLEOCAPSID PROTEIN. 
RNA POLYMERASE SIGMA- 



28 

95 

1 

. 6 

761 

5 

NCA2 

HUMAN 

NEURAL CELL ADHESION 

4 . 

44e+00 










29 

95 

1 

. 6 

761 

6 

RIR1 

SALTY 

RIBONUCLEOS IDE-DIPHOS 

4 . 

44e+00 










30 

95 

1 

. 6 

393 

6 

RPOC 

HALMO 

DNA-DIRECTED RNA POLY 

4. 

44e+00 










31 

95 

1 

. 6 

1407 

7 

TRHY_ 

_RABIT 

TRICHOHYALIN. 

4 . 

44e+00 










32 

95 

1 

. 6 

503 

1 

ATPA_ 

_SYNP1 

ATP SYNTHASE ALPHA CH 

4 . 

44e+00 










33 

95 

1 

. 6 

859 

3 

ENV EI AVI 

ENV POLYPROTEIN PRECU 

4. 

44e+00 










34 

94 

1 

. 6 

114 

4 

HV2A_ 

_RABIT 

IG HEAVY CHAIN V-A2 R 

5. 

90e+00 










35 

94 

1 

. 6 

1098 

6 

RPOP_ 

_MAIZE 

PROBABLE DNA-DIRECTED 

5. 

90e+00 










36 

94 

1 

. 6 

797 

5 

PHSM_ 

_ECOLI 

MALTODEXTRIN PHOSPHOR 

5. 

90e+00 










37 

94 

1 

. 6 

1115 

5 

NCA1_ 

_MOUSE 

NEURAL CELL ADHESION 

5. 

90e+00 










38 

94 

1 

. 6 

725 

5 

NCA3_ 

_MOUSE 

NEURAL CELL ADHESION 

5. 

90e+00 










39 

94 

1 

. 6 

1031 

4 

KINH_ 

_STRPU 

KINESIN HEAVY CHAIN. 

5. 

90e+00 










40 

94 

1 

. 6 

182 

8 

YANC_ 

_PSEAE 

HYPOTHETICAL 19.8 KD 

5. 

90e+00 










41 

94 

1 

. 6 

513 

5 

PHSL_ 

_DESBA 

PERIPLASMIC [NIFESE] 

5. 

90e+00 










42 

93 

1 

. 6 

382 

5 

MSN1_ 

_YEAST 

MSN1 PROTEIN (MULTICO 

7 . 

81e+00 










43 

93 

1 

. 6 

859 

3 

ENV EIAV2 

ENV POLYPROTEIN PRECU 

7. 

81e+00 










44 

92 

1 

.6 

156 

8 

Y01F_ 

_BPT4 

HYPOTHETICAL 18.3 KD 

1. 

03e+01 










45 

92 

1 

.6 

68 

4 

HSP1_ 

_TACAC 

SPERM PROTAMINE PI (C 

1. 

03e+01 










ALIGNMENTS 

RESULT 1 

ID VA39_VACCC STANDARD; PRT; 4 03 AA. 

AC P21062; 

DT 01-FEB-1991 (REL. 17, CREATED) 

DT 01 -FEB- 19 91 (REL. 17, LAST SEQUENCE UPDATE) 

DT 01-FEB-1991 (REL. 17, LAST ANNOTATION UPDATE) 

DE PROTEIN A3 9. 

GN A39R. 

OS VACCINIA VIRUS (STRAIN COPENHAGEN) . 

OC VIRIDAE; DS-DNA ENVELOPED VIRUSES; POXVIRIDAE; 

CHORDOPOXVIRINAE ; 

OC ORTHOPOXVIRUSES . 

RN [1] 

RP SEQUENCE FROM N.A. 


RM 91021027 

RA GOEBEL S.J., JOHNSON G.P., PERKUS M.E., DAVIS S.W., WINSLOW 
J.P. , 

RA PAOLETTI E. ; 

RL VIROLOGY 179:247-266 (1990) . 

RN [2] 

RP COMPLETE GENOME. 

RA GOEBEL S.J., JOHNSON G.P., PERKUS M.E., DAVIS S.W., WINSLOW 
J. P., 

RA PAOLETTI E.; 

RL VIROLOGY 179: 517-563 (1990) . 

DR EMBL; M35027; PXVACCG. 

DR PIR; E42521; E42521. 

SQ SEQUENCE 403 AA; 45741 MW; 907305 CN; 

DB 7; Score 158; Match 32.7%; QryMatch 2.7%; Pred. No, 
9.12e-10; 

Matches 37; Conservative 29; Mismatches 33; Indels 

14; Gaps 10; 

Db 170 

dkvyilf td-tigskrivkipy — iaqmclndeggpsslsshrwstf lkvelecdi d 223 

I I I I : : I : : I : : I I : I : I I I I I II : : I : I I I I II 

Qy ' 241 

DKVYFFFRENAIDGEHSGKATHARIGQICKNDFGGHRSLV-NKWTTFLKARLICSVPGPN 299 

Db 224 gr-s-yrqi — ihsrtiktdndtily-vf f dspys-k-salctysmntikqsf 

269 

I : : : : : : I : : : : I I I I : I I I : I I I I : : : : I 

Qy 300 GIDTHFDEL QDVFLMNFKD P KNP WYGVFT T S S N I FKG S AVCM Y SMS D VRRVF 
352 

Search completed: Wed Nov 1 08:30:41 1995 
Job time : 35 sees. 


